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Abstract: Hue-Saturation-Intensity (HSI) color model, a psychologically appealing color model, was employed to visualize uncertainty 
represented by relative prediction error based on the case of spatial prediction of pH of topsoil in the peri-urban Beijing. A two-dimensional 
legend was designed to accompany the visualization—vertical axis (hues) for visualizing the predicted values and horizontal axis (whiteness) 
for visualizing the prediction error. Moreover, different ways of visualizing uncertainty were briefly reviewed in this paper. This case study 
indicated that visualization of both predictions and prediction uncertainty offered a possibility to enhance visual exploration of the data un¬ 
certainty and to compare different prediction methods or predictions of totally different variables. The whitish region of the visualization 
map can be simply interpreted as unsatisfactory prediction results, where may need additional samples or more suitable prediction models 
for a better prediction results. 
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Introduction 

Since 1980s, mathematical model has been broadly applied in 
spatial predictions of soil and other landscape attributes. Soil is a 
very complicated non-linear dynamics system on the earth sur¬ 
face. Any model can not absolutely, accurately describe and 
reflect soil variability in the real world. Therefore uncertainty is 
always being in spatial prediction results. Spatial uncertainty has 
two connotations, namely attribute uncertainty and spatial loca¬ 
tion uncertainty, existing in all links, for example, datum meas¬ 
urement, statistical calculation and construction of model. Fur¬ 
thermore, uncertainty of original datum and model can be 
propagated through spatial predicted model, inducing uncertainty 
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of final output results and the related error appraisal (Mowrer 
and Congalton 2000). Since 1990s, it has been attached more 
importance to assessment of the uncertainty in spatial prediction 
of soil properties, and this assessment has being brought into 
subsequent decision-making processes, such as delineation of 
polluted areas or identification of zones that are suitable for crop 
growth (Goovaerts 2001). The logical standard judging quality of 
soil spatial predicted map is the content of information that con¬ 
sists of information content of soil properties, uncertainty analy¬ 
sis, and information recognition potential. Among the above 
information, the visualization of uncertainty is one of key factors 
to determine quality of soil spatial predicted map. With the de¬ 
veloping of GIS technology, visualization of uncertainty is be¬ 
coming a vital new connotation of soil mapping process. Visu¬ 
alization of uncertainty can be divided into two different ways: 1) 
independent visualization of uncertainty, as an independent ad¬ 
jacent map for spatial prediction map; 2) synchronous visualiza¬ 
tion, combined with uncertainty analysis and spatial predicted 
results on the same map. 

Until the mid-1990s, uncertainty assessment has been essen¬ 
tially performed using non-linear kriging approaches that aim at 
evaluating the probability for the target attribute greater than the 
threshold value and setting confidence range at a specific un¬ 
monitored location (Smith et al. 1993; Webster and Oliver 1989; 
Goovaerts and Journel 1995; Li et al. 2004). Synchronous visu¬ 
alization is always implemented through two ways. One new 
layer with the uncertainty information is established on the spa¬ 
tial prediction map, which changes information of uncertainty 
mainly through transparency, haze or fog, blurring and texture 
and patterns under spatial prediction map (Dutton 1992; 
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Maceachren 1992; Monmonier 1990; Pang et al. 1994). The 
other constructs spatial prediction map with uncertainty informa¬ 
tion through improvement of mapping technology combining the 
uncertainty information with predicted results in the same layer. 
Hengl et al. (2004) used HSI color model to visualize both the 
spatial prediction of topsoil thickness and uncertainty analysis in 
the same layer for the first time. However, the related researches 
in the above field are scarce in China. This paper uses HSI color 
model to visualize spatial predicted result associated with uncer¬ 
tainty expressed by relative prediction error on the same map, 
based on a case study of spatial prediction of topsoil pH in Bei¬ 
jing Peri-urban. 

Materials and methods 

Sample collection and analysis 

The study area is located within the peri-urban zone of Beijing 
(about 2 600 km 2 ). In total of 220 topsoil samples (0-20 cm) 
were collected from the grid cells about 2 km x 2 km from April 
to May of 2001. Each point in Fig. 1 (enlarged diagram on the 
right side) represents a 5 m x 5 m block where five soil samples 
were collected from the center and corners to make one compos¬ 
ite sample. The global positioning system (GPS) method was 
used to record geographical coordinates of each sampled site, 
and environmental observations were described during the 
fieldwork. The values of pH were measured by electrode method 
in liquor mixed water with soil by 2.5:1. 

Geo-statistical method 

The semi-variance function of sampling point was calculated 
based on GS + software. The pH values of un-sampled soil were 
best and un-partially predicted through ordinary Kriging method 
in Geo-statistical method by ArcGIS software, and prediction 
map and standard predicted error map were derived through the 
software. 



Fig. I Distribution 
o f soil-sampling 
sites in Study area 


The HSI color model 

The first important requirement for successful visualization of 
uncertainty is to select a psychologically appealing color model. 
For example, a logical color variable for visualizing uncertainty 
is whiteness. HSI model seems to be the most promising for the 
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visualization of uncertainty. This was confirmed by Jiang (1996) 
who conducted a number of perception tests. Another important 
reason for using this color model is the fact that it is geometri¬ 
cally related to the RGB color cube, which makes it possible to 
calculate intermediate and mixed colors. Whiteness is amount of 
the white color, which in the color cube can be visualized as the 
shortest distance to the white comer of the RGB cube. The hue 
(H) represents the visual sensation of the color type, and is cal¬ 
culated as the number of degrees around the axis. The saturation 
(S) represents the degree to which the color expresses its hue, 
and is calculated as the radial distance from the diagonal axis. 
The intensity (I) represents the visual sensation of brightness 
(Fig. 2a). 

Visualization of uncertainty 


The uncertainty of prediction is represented with the normalized 
or relative prediction error expressed in percentage, which is to 
divide the prediction error of the transformed variable by the 
standard deviation of observed samples. 

&E r( S o) = —Xl00% (1) 


where, <T £ (s 0 ) is the prediction error, S z is the standard 

deviation of observed samples. A satisfactory prediction is when 
the model explains more than 85% of the total variation. As a 
mle of thumb (Jiang 1996; Hengl 2003; Park and Vlek 2002), we 

can consider that a value of <7 E (s 0 ) close to 40% means a 

fairly satisfactory accuracy of prediction. Otherwise, if the values 
get above 80%, the model accounted for less than 50% of vari¬ 
ability at the validation points and prediction is unsatisfactory. 
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Fig. 2 Design of the legends for visualization of uncertainty, (a) 
Flue-Saturation-Intensity color model and the Red-Cireen-rilue 
color cube; (b) the two-dimensional legend used for visualization of 
uncertainty in quantitative data 
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The values of prediction and relative prediction error must be 
standardized by maximal and minimal value before coding by 
using: 


S = (l-M r )'240 

7 = (l + wJ-120 
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U 2 U y 



where, Z r is the standard prediction, U r is the standard pre¬ 
diction uncertainty (Z r , U r e[0, 1]). Z is the prediction map, 
°E,r is the relative prediction error map standardized using 

Eq.(l), Zi and z 2 , ui and u 2 are the lower and upper limits of the 
predicted values and relative prediction error. For prediction 
variables, the range limits are the measured minimum and 
maximum, and for uncertainty the thresholds are 40% and 100%. 

The map of predictions and uncertainty can be visualized si¬ 
multaneously by coding the predictions with hues and uncer¬ 
tainty with whiteness. The predictions are coded to hue using: 


Design of the legend 

Two-dimensional legend was designed for prediction of con¬ 
tinuous variables. Here, vertical axis represented the predicted 
values and horizontal axis (whiteness) is used to visualize the 
prediction error (Fig. 2b). On the vertical axis, hues ranged from 
blue for low values (-90°) to red for high values (-30°). A part of 
the hue circle representing magenta (-30°to -90°) has been omit¬ 
ted to avoid confusion between high and low values. On the 
horizontal axis, the saturation-intensity changes linearly from 
low to total whiteness. The combined visualization gives insight 
into the relationship between uncertainty and input data for the 
given thresholds. Full color (the original RGB) means the mini¬ 
mum uncertainty is equal to or less than 40%, and full whiteness 
indicates the maximum uncertainty equal 100%. 


<p x = -90 + z r x-300 



fft+360 if (p x <-360 
1ft if ft >-360 



where, (p x is the hue angle in degrees measured clockwise, 

ft is the value transformed from the -360 to 360 range. The 
HSI-coded image is then derived using: 


Results and discussion 

Descriptive statistics 

Descriptive statistics results showed that the variability of pH 
was quite small. Distribution of the data was accordance with 
normal distribution using K-S test (K-S, 0.362). The mean value 
was 8.37 (the minimum was 7.35; the max. 9.06), therefore the 
soil was alkaline. 

Geo-statistical analysis 


H = (<p 2 +360)- 


240 

360 


Experimental Isotropic semi-variogram of topsoil pH at full ex¬ 
tent was matched with exponential model best, R 2 was 0.935. 
Effective range was 18.99 km, Nugget was 50.2%, demonstrat¬ 
ing that pH had a moderate spatial correlation in a relative large 
range. 
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Fig. 3 Prediction map of topsoil pH (a), relative prediction error (uncertainty) (b). visualization of both predictions and prediction 
uncertainty of pH for topsoil, which accompanying two-dimensional legend—predictions with hue. uncertainty with whiteness (c) 
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Visualization of predictions and uncertainty 

Spatial prediction map of pH was derived by ordinary Kriging 
method in ArcGIS software. The results of cross validation 
showed that mean standardized error was -0.0022, 
root-mean-square error (0.1811) nearly equaled to average stan¬ 
dard error (0.1736). The value of root-mean-square standardized 
error approximately equaled to 1. The mean error was -0.0005. 
All data demonstrated that the precision of prediction was quite 
perfect. 

The spatial prediction map of pH presented patch shape. High 
values were mainly distributed in the west-south corner of the 
study area. Outside of the west-south comer, the value was de¬ 
creasing. Low values were distributed in the west-north corner 
(Fig. 3a). Uncertainty map showed that relative prediction error 
was increasing far from the sample points. In the center and four 
comers, the uncertainty was relatively higher, of which the high¬ 
est value was 92% (Fig. 3b). Most area was whitish (Fig. 3c), 
meant that the spatial prediction was not satisfactory. The visu¬ 
alization resulted in most of the study area far from the points 
being pale. These areas would probably need additional samples 
or more suitable prediction models. This study used different 
kriging methods to do the uncertainty analysis and the results 
were nearly the same, the uncertainty was all very high. In case 
of significance was very little, this paper did not list other results. 
If the interpolated data would be perfect, it is very interesting to 
do this. 

Conclusions 

The results of cross validation showed that precision of spatial 
prediction was relatively high. But the uncertainty was also quite 
high, which the mean relative prediction error was about 88%. 
Prediction standard error was influenced by sampling density and 
distribution pattern. The relative prediction error, i.e., the uncer¬ 
tainty will be smaller, and the spatial prediction will be more 
credible, if the sampling density is denser and the distribution is 
more regular. In this case the samples distributed uniformity, but 
the sampling space was large, and relative prediction error was 
quite high. Therefore it is an effective way to improve the sam¬ 
pling density to reduce the prediction uncertainty. 

Visualization of both predictions and prediction uncertainty 
offers a possibility to enhance visual exploration of the data un¬ 
certainty and to compare different prediction methods or predic¬ 
tions of total variables. The whitish area can be simply inter¬ 
preted as unsatisfactory predictions, i.e. areas that need to be 
sampled additionally. 

In the process of visualization both prediction and prediction 
uncertainty at the same time, the two dimensional legends 
showed non-neglected limiting effect. The matter is that it is not 
easy to match the colors on the HSI coded image with the legend 
because the pale colors are harder to be distinguished. Moreover, 


it is not easy to read the actual uncertainty within ±5% of the 
relative prediction error by just looking at the whiteness on the 
map. Therefore, further effort should be focused on improvement 
of the legend designed. 

Analysis and visualization of uncertainty don’t aim at elimi¬ 
nating uncertainty in spatial prediction, but using existing infor¬ 
mation to assess the accuracy and precision of uncertainty mod¬ 
els, and then visualize it straightly. It is very important to make 
actual decisions based on the prediction results. Take example of 
this case, the spatial prediction uncertainty is high, therefore it 
can’t guide agriculture production and administer soil resource 
for fear acidification. 
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